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1  Introduction 


Given  a  hypergraph  H  =  (V,E),  the  set  cover  problem  consists  of  finding  a  minimum  size  subset 
R  C  V  which  covers  H ;  i.e.,  for  which  e  n  R  /  0  for  all  e  £  E.  This  is  equivalent  to  the  problem 
of,  given  a  set  system  A  C  2X ,  finding  a  minimum  subcollection  A'  C  A  such  that  (J  A'  =  X .  The 
set  cover  problem  is  NP-complete  [K],  so  we  will  not  be  concerned  with  algorithms  giving  exact 
solutions;  rather,  we  will  consider  algorithms  giving  approximate  solutions. 

The  best  known  polynomial-time  approximation  algorithm  for  set  cover  is  the  greedy  set  cover 
algorithm  [J,  Lo].  Surprisingly,  showing  that  the  greedy  algorithm  performs  well  is  fairly  challeng¬ 
ing,  as  is  evident  in  the  proofs  of  Johnson  and  Lovasz.  They  show  that  the  greedy  algorithm  always 
produces  a  cover  of  size  at  most  log  A  times  optimal,  where  A  is  the  maximum  degree  of  H  (i.e. 
the  maximum  number  of  edges  containing  any  node).  However,  the  greedy  algorithm  is  inherently 
sequential.  Although  RNC  algorithms  have  been  proposed  which  perform  well  for  some  special 
cases  [CF,  ABLP,  VL],  no  parallel  algorithm  which  performs  well  on  arbitrary  instances  has  been 
developed  until  now. 

In  this  paper,  we  give  a  linear-processor  deterministic  NC  algorithm  that  always  finds  a  cover 
which  is  within  a  log  A  factor  of  the  optimal  size.  Hence,  the  algorithm  achieves  the  same  perfor¬ 
mance  as  the  best  sequential  algorithm,  both  in  terms  of  cover  size  and  processor-time  product.  In 
addition,  the  algorithm  works  equally  well  for  weighted  hypergraphs. 

We  also  consider  a  related  problem,  which  we  call  the  balanced  set  cover  problem.  Specifically, 
given  a  hypergraph  H  =  ( V ,  E )  and  some  fixed  c,  0  <  c  <  1,  find  a  minimum  size  subset  R  C  V  such 
that  \R  H  e\  >  c|e||^j  for  all  e  £  E.  We  call  this  a  balanced  cover  since  for  each  edge,  the  fraction 
of  its  vertices  in  the  cover  is  at  least  a  constant  times  the  average  fraction.  Clearly,  balanced  set 
cover  differs  from  normal  set  cover  in  that  instead  of  covering  every  edge  once,  every  edge  is  covered 
multiple  times  in  proportion  to  its  size.  Chazelle  and  Friedman  [CF]  first  considered  this  problem 
and  developed  a  polynomial-time  approximation  algorithm  which  produces  an  R  of  size 
where  an  is  the  size  of  the  smallest  edge.  Their  algorithm  is  a  modified  version  of  the  greedy  set 
cover  algorithm  and  is  just  as  sequential  in  nature.  In  this  paper,  we  give  an  NC  algorithm  for 
balanced  set  cover,  which  uses  a  linear  number  of  processors,  and  obtains  the  same  bound  on  the 
cover  size  as  the  sequential  [CF]  algorithm. 

The  set  cover  problems  we  consider  have  several  applications  in  parallel  learning  theory  and 
computational  geometry.  Of  the  two,  learning  is  more  interesting,  because  of  all  the  fields  in  which 
people  strive  to  develop  parallel  algorithms,  learning  is  probably  one  of  the  most  natural.  In  this 
paper,  we  consider  a  fundamental  learning  problem  that  has  been  solved  in  the  sequential  domain 
[BEHW],  and  solve  it  in  parallel.  In  particular,  we  consider  the  problem  of  learning  in  concept 
classes  that  are  formed  by  taking  either  finite  unions  or  finite  intersections  of  a  fixed  base  class  of 
finite  VC  dimension.  We  show  that  classes  of  this  type  are  NC-learnable  whenever  there  is  an  NC 
algorithm  for  finding  a  consistent  hypothesis  in  the  base  class.  The  only  previous  parallel  work 
on  this  subject  is  in  [VL],  They  give  an  RNC  algorithm  for  learning  s-fold  unions  of  axis-parallel 
rectangles  in  the  plane,  by  using  a  randomized  set  cover  algorithm  for  specialized  hypergraphs.  ' 

Our  general  techniques  solve  this  problem  as  a  special  case.  In  addition,  while  [VL]  may  produce  a  - - 

hypothesis  with  up  to  s4  log  m  rectangles,  our  method  will  always  produce  one  with  at  most  s  log  m  j 

rectangles,  due  to  the  performance  guarantee  of  our  set  cover  algorithm.  j- 

In  computational  geometry,  randomization  is  frequently  used  to  construct  poly-time  algorithms _ 

[Cll,  C12,  C13,  EGS,  HW,  RSl,  RS2].  Recently,  Chazelle  and  Friedman  [CF]  showed  how  to  make  - 

most  of  the  randomized  algorithms  used  in  computational  geometry  deterministic,  albeit  at  the  _ 

cost  of  a  substantial  increase  in  running  time.  To  remove  randomness,  they  use  balanced  set  cover. 

Our  parallel  results  can  be  used  to  put  many  of  these  geometry  algorithms  into  deterministic  NC,  Codei 
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with  the  number  of  processors  required  being  equal  to  the  running  time  of  Chazelle  and  Friedman’s 
algorithm. 

The  remainder  of  this  paper  is  divided  into  sections  as  follows.  In  Section  2,  we  present  our  NC 
algorithms  for  approximate  set  cover.  In  Section  3,  we  show  how  to  use  set  cover,  as  well  as  other 
tools,  to  make  the  learning  problem  parallel.  In  Section  4,  we  outline  the  technique  of  Chazelle  and 
Friedman  and  describe  some  of  the  ideas  that  are  needed  to  parallelize  their  algorithms. 

We  discovered  the  balanced  set  cover  results  in  late-February.  Shortly  thereafter,  Motwani, 
Naor,  and  Naor  [MNN]  announced  that  they  also  could  obtain  an  NC  algorithm  for  balanced  set 
cover.  Although  we  have  not  seen  their  proof,  they  have  indicated  that  their  method  uses  (logn)- 
wise  independence  (instead  of  pairwise  independence  —  as  in  our  method),  and  thus  it  appears  to 
require  many  more  pioeessors  for  balanced  set  cover  than  our  method.  Our  stronger  results  for 
general  set  cover  and  the  applications  to  learning  were  discovered  in  mid-April,  and  do  not  seem 
to  have  been  duplicated  by  Motwani,  Naor,  and  Naor.  Nor  does  it  appear  that  these  results  can 
be  naturally  derived  from  their  methods. 

2  NC  Algorithms  for  Approximate  Set  Cover 

In  this  section,  we  consider  the  set  cover  problem:  given  a  hypergraph  H  =  (F,  E),  find  a  minimum 
size  subset  R  C  F  which  covers  II ,  i.e.  for  which  e  fl  R  ^  0  for  all  e  £  E.  As  previously  stated,  we 
will  consider  algorithms  giving  approximate  solutions.  The  best  known  approximation  algorithm, 
due  to  Johnson  and  Lovasz  [J,l  o],  is  the  greedy  set  cover  algorithm.  This  works  by  repeatedly 
picking  the  vertex  of  maximum  degree  (i.e.  the  one  contained  in  the  most  edges),  adding  it  to  the 
cover,  and  then  deleting  it  and  all  edges  containing  it  from  the  hypergrapn.  Surprisingly,  showing 
that  this  simple  greedy  algorithm  performs  well  is  fairly  challenging,  as  is  evident  in  the  proofs  of 
the  following  theorem. 

Theorem  2.1  (J,Lo)  Let  H  =  (F,  E)  be  a  hypergraph  with  maximum  degree  A.  Define  a  fractional 
cover  of  II  to  be  a  function  f  :  V  — *  TZ  such  that  0  <  f(v)  <  1  for  all  v  £  V  and  Ylvee  f(v )  -  1 
for  all  e  £  E.  (Note  that  a  normal  cover  is  a  fractional  one  where  f(v)  £  {0, 1}  for  all  v  £  V .) 
Let  t  be  the  size  of  the  optimal  cover  of  H ,  and  let  r *  be  the  size  (i.e.  YlveV  f(v))  °f  Me  optimal 
fractional  cover  of  //.  Then  the  size  of  the  cover  produced  by  the  greedy  set  cover  algorithm  is  at 
most  (In  A  +  1  )r*  <  (In  A  +  l)r. 

Remark  2.2  The  following  problems  can  easily  be  shown  to  be  equivalent  to  set  cover: 

1.  Given  a  hypergraph  H  —  (F,  E),  find  a  minimum  size  subset  of  the  edges  which  covers  F. 

2.  Given  a  bipartite  graph  G  =  (X,Y,E),  find  a  minimum  size  subset  of  X  which  dominates  Y. 

An  approximation  algorithm  for  any  one  of  these  problems  can  be  converted  to  approximation 
algorithms  for  the  others  with  the  same  performance  guarantee. 

We  will  also  consider  approximation  algorithms  for  a  variation  of  the  set  cover  problem,  which 
we  call  the  balanced  set  cover  problem:  given  a  hypergraph  H  =  (F,  E),  and  some  fixed  c,  0  <  c  <  1, 
find  a  minimum  size  subset  R  C  F  such  that  jiZ  n  e\  >  c|e||^|  for  all  e  £  E. 

In  this  section,  we  will  provide  efficient  parallel  approxjmation  algorithms  for  these  problems. 
In  Section  2.1.  we  give  an  NC  algorithm  for  approximate  balanced  set  cover,  which  produces  an  R 
of  size  0(  losHgl> ),  where  an  is  the  size  of  the  smallest  edge.  A  sequential  algorithm  achieving  the 
same  bound  was  used  in  [CF]  to  obtain  deterministic  algorithms  for  several  geometry  problems. 
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Our  result  allows  us  to  parallelize  these  problems  (see  Section  4).  In  Section  2.2,  we  give  an 
NC  algorithm  for  approximate  set  cover,  which  achieves  the  same  bound  as  the  greedy  algorithm 
(within  a  (1  +  c)  factor).  Our  algorithm  builds  upon  the  techniques  used  in  our  balanced  set  cover 
algorithm.  In  Section  2.3,  we  show  how  to  reduce  the  processor  count  of  both  these  algorithms  to 
be  linear.  Finally,  in  Section  2.4,  we  show  how  to  extend  our  set  cover  algorithm  to  handle  weighted 
hypergraphs  [C]. 

2.1  Balanced  Set  Cover 

Here  we  consider  the  balanced  set  cover  problem.  Specifically,  given  a  hypergraph  H  =  (V,  E)  with 
minimum  edge  size  an,  we  want  to  find  a  subset  R  C  V,  |/?|  <  81og(nm)/a,  such  that  for  all  e  £  E, 
|e  fl  for  some  constant  c  >  0. 

The  first  way  one  might  consider  finding  such  an  R  is  to  simply  choose  R  at  random.  It  can  be 
easily  verified  that  if  each  vertex  is  given  an  independent  — probability  of  being  included  in 
R,  then  with  high  probability  R  will  be  good. 

Another  method  for  finding  a  good  R  is  the  “modified  greedy  algorithm”  of  [CF],  which  works 
as  follows.  First  break  up  the  edges  so  that  every  edge  contains  between  an  and  2 an  vertices.  Then 
modify  the  greedy  set  cover  algorithm  by  putting  weights  on  edges.  Initially  each  edge  has  weight 
1.  The  weight  of  a  vertex  is  defined  as  the  sum  of  the  weights  of  the  edges  which  contain  it.  The 
algorithm  repeatedly  picks  the  vertex  with  maximum  weight.  Instead  of  removing  an  edge  when  it 
is  covered,  its  weight  is  cut  in  half.  Chazelle  and  Friedman  show  that  running  this  algorithm  for 
41og(nm)/a  steps  produces  a  good  R. 

Suppose  we  want  an  NC  algorithm  to  find  a  good  R.  The  first  approach  above  gives  a  natural 
RNC  algorithm,  but  needing  full  independence.  The  second  is  inherently  sequential.  We  provide  a 
hybrid  of  these  two  algorithms  which  depends  on  only  pairwise  independence,  and  then  use  Luby’s 
method  [L2]  to  get  a  deterministic  NC  algorithm.  In  this  case,  the  hard  part  is  getting  an  RNC 
algorithm  that  only  uses  pairwise  independence. 

Our  approach  first  handles  two  special  cases:  if  r  >  n,  we  let  R  =  V  and  quit;  if  a  >  1/log2  n 
(implying  r  <  Sdlog3/!),  we  run  the  “modified  greedy  algorithm”  directly.  Otherwise,  we  alter 
the  “modified  greedy  algorithm”  as  follows:  rather  than  picking  one  vertex  at  a  time,  we  pick  a 
large  set  of  vertices  which  significantly  decrease  the  total  edge  weight.  This  is  accomplished  by 
picking  vertices  with  probability  where  6  is  a  small  constant.  This  implies  that  for  each  edge, 
the  expected  number  of  vertices  picked  is  between  6  and  26.  (Actually,  we  are  ignoring  the  fact 
that  the  size  of  edges  decreases  as  they  are  hit.  However,  one  can  ensure  that  all  edges  not  hit 
enough  times  already  contain  at  least  7an/8  vertices;  thus,  the  numbers  here  are  off  by  at  most  a 
factor  of  8/7.)  Because  we  are  picking  such  a  sparse  set  of  vertices,  even  assuming  just  pairwise 
independence,  we  can  show  that  the  probability  of  covering  a  given  edge  at  least  once  is  at  least 
6  —  2 62.  So  we  expect  to  reduce  the  total  edge  weight  by  a  ^~2—  fraction,  at  least.  After  about 
41ogm/£  iterations,  we  expect  the  total  weight  to  have  dropped  from  m  to  1/m.  In  this  case,  every 
edge  has  weight  at  most  1/m,  and  thus  has  been  hit  at  least  logm  times.  So  |/?|  will  be  about 
4  ‘T™  a  >  and  \e  n  #1  wiH  at  least  logm,  which  is  at  least  |je|^. 

To  make  this  deterministic,  we  need  to  be  able  to  construct  a  set  of  about  6 /a  vertices  which 
causes  the  total  weight  to  decrease  by  about  6/2.  Since  we  krow  that  picking  pairwise  independently 
with  probability  ^  is  expected  to  work,  we  could  apply  the  techniques  of  [KW,L1 , A BI] ,  trying  every 
sample  point  of  a  small  pairwise  independent  distribution,  to  get  a  deterministic  NC  algorithm. 
Unfortunately,  this  entails  an  0(n2)  blowup  in  the  number  of  processors.  Instead,  we  will  apply 
the  techniques  of  [L2],  first  obtaining  a  blowup  of  only  0(an),  then,  in  Section  2.3,  showing  how 
to  achieve  a  linear  number  of  processors. 
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To  apply  the  techniques  of  [L2],  we  must  reduce  tiie  problem  to  finding  an  X  =  <  Aj , . . . ,  Xn  > 
with  BENEFIT(X)  >  E[BENEFIT(X)} ,  for  some  BENEFIT  function  which  is  a  sum  of  terms 
depending  on  one  or  two  random  variables  each.  Our  first  cut  at  a  BENEFIT  function  is  to  try  to 
lower  bound  the  decrease  in  total  weight.  Consider  some  edge  e  €  E  with  weight  iue.  A  good  lower 
bound  on  the  weight  removed  on  edge  e  is  ^  V.ee**>'  ie->  we/2  i f  edge  e  is  hit  and  0  otherwise. 
This  is  not  a  good  function  to  work  with,  since  to  apply  [L2]  we  must  have  a  function  which  is  a 
sum  of  terms  depending  on  one  or  two  variables  each.  However,  we  can  further  lower  bound  by  the 
first  two  terms  of  the  inclusion-exclusion  expansion, 

f  (£*<  -  £  *i*i)  =  BENEFlTe(X). 

«€e  i,j£e 

It  is  easily  verified  from  this  that  E[BENEFITe(X)}  >  %f-{6  —  262).  So 


£[^T  BENEFITe{X)\  > 

e£E 


Wfotal 


(6  -  2 62). 


Hence,  we  might  try  to  apply  T.'iby’s  technique  to  get  an  A'  such  that 

J2  BENEFIT e(X)  >  ^Sl(S  -  2 S2). 

eeE  2 

Why  isn’t  this  enough?  Because  the  X  output  may  be  1  in  significantly  more  than  6/a  places. 
In  this  case,  after  the  correct  number  of  iterations,  every  edge  would  be  hit  logm  times,  but  R 
would  be  too  large.  So,  we  must  modify  the  BENEFIT  function  so  that  this  cannot  occur.  Let 
|*|  =  E,€v*.-  We  let 

BENEFIT(X)  =  £  BENEFIT e{X)  -  a6wtotai  (|A|  -  £[|A|])2. 

e£E 

Note  that  BENEFIT(X)  is  still  a  lower  bound  on  the  weight  removed  by  picking  A'. 

We  already  argued  that  E\£eeE  BENEFIT t(X)\  >  ^f^{6  -  262).  And, 

£[(£X,-£[£A,])2]  =  £[(£A,)2-£[£At]2] 

•ev'  •ev'  iev  iev 

=  X:  £[*.2]  -  £  -  L  Eix>)2 

t,  jGV  ieV  t.jCV'  i£\ 

=  £(£t  X2]-£[X,]2) 

=  |v|(A  _  (/-¥) 

<  6/a. 

So  E[BENEFIT{X)}  >  wtotal{\ -  262). 

Lemma  2.3  BENEFIT(A)  >  0  =>  £  -  yf&  <  \X\  <  f  +  /J. 

Proof:  Assume  ||A|  -  ||  >  y/&.  Then  f  |A|  -  £[|A|])2  >  Jy.  Sc  a6wMal( \X\ -  £[|A|])2  >  wiotal 
and  BENEFIT(X)  <  0.  □ 
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Plugging  into  [L2],  we  get  an  X  with  BENEFIT(X)  >  wtota\(\  -  262).  Thus  we  can  pick  a  set 
of  size  at  most  £  +  ^1—  which  causes  the  total  weight  to  decrease  by  approximately  a^/2  fraction. 
This  allows  us  to  get  an  R  with  the  desired  properties. 

The  running  time  of  this  procedure  is  O(logm)  times  the  ©(log3?!)  time  required  for  one 
invocation  of  [L2].  The  number  of  processors  is  0(]CeeE  H2  +  n2)  =  0(anY,e€E  lel  +  n2)-  Assuming 
the  ra2  term  is  insignificant,  this  is  an  times  the  size  of  the  input,  0(^2ee£  |e|).  In  Section  2.3,  we 
show  how  to  achieve  a  linear  number  of  processors  for  this  problem. 

2.2  Set  Cover  within  log  A  of  Optimal 

Now  we  turn  our  attention  to  the  problem  of,  given  a  hypergraph  H  =  (V,E),  finding  a  cover  of 
H  containing  0(r*  log  A)  vertices,  where  A  is  the  maximum  degree  of  H  and  r*  is  the  size  of  the 
optimal  fractional  cover  of  H . 

In  the  previous  section,  since  a  random  cover  worked  with  high  probability,  to  get  an  RNC 
algorithm  using  only  pairwise  independence,  we  could  simply  pick  a  sequence  of  random  sparse 
subsets  and  use  the  metric  from  the  greedy  algorithm  to  ensure  each  one  contributed  to  a  good 
cover.  Here,  however,  since  a  random  cover  is  not  likely  to  be  good,  the  behavior  of  our  algorithm 
must  more  closely  follow  the  behavior  of  the  greedy  algorithm. 

To  find  a  near-optimal  cover,  we  proceed  as  follows.  Let  e  be  a  constant,  0  <  e  <  1/12.  We  can 
cover  H  in  a  series  of  phases.  At  the  beginning  of  phase  i  (phases  will  be  sequenced  in  decreasing 
order),  the  hypergraph  induced  by  the  algorithm  thus  far  will  have  maximum  degree  less  than 
(1  +  ()'.  During  this  phase,  we  will  restrict  our  attention  to  the  subhypergraph  induced  by  the 
vertices  of  degree  between  (14-  c)‘-1  and  (1  +  ()'  and  will  only  add  these  vertices  to  the  cover. 
In  this  way  we  only  add  to  the  cover  vertices  that  have  degrees  close  to  the  maximum,  thereby 
emulating  the  greedy  algorithm. 

In  the  previous  section,  since  the  edges  were  guaranteed  to  range  in  size  between  an  and  2cm 
vertices,  we  were  able  to  pick  vertices  pairwise  independently  with  probability  to  get  a  good  set. 
Here,  however,  the  edge  sizes  can  vary  arbitrarily,  making  the  task  of  getting  a  single  probability 
impossible  If  we  set  it  too  low.  then  t.h«  small  edges  will  not  be  hit  quickly  enough.  If  we  set  it  too 
high,  then  the  large  edges  will  be  hit  many  times;  this  is  undesirable  since  the  average  number  of 
edges  covered  by  each  vertex  could  become  much  smaller  than  the  maximum  degree,  thus  deviating 
from  the  behavior  of  the  greedy  algorithm.  Our  solution  is  to  perform  a  sequence  of  subphases. 
At  the  beginning  of  subphase  j  (again  sequenced  in  decreasing  order),  all  edges  will  contain  fewer 
than  (1  +  e)J  vertices  (which  were  restricted  above  to  have  degrees  between  (1  4-e)*-1  and  (1  +c)’). 
During  this  subphase,  we  will  repeatedly  pick  vertices  with  probability  where  0  <  6  <  1/12. 

Picking  vertices  with  this  probability  will  allow  us  to  cover  a  6/2  fraction  of  the  edges  of  size  at 
least  (1  4-  c)J_1,  but  will  not  cause  many  edges  to  be  hit  more  than  once,  since  no  edge  is  larger 
than  (1  4-  e)J. 

More  precisely,  a  subphase  consists  of  a  series  of  selection  steps,  performed  until  there  are 
no  more  edges  of  size  at  least  (14-  c)J_1.  The  selection  steps  are  of  two  types.  If  one  or  more 
vertices  cover  a  63/(  1  4-  e)  fraction  of  these  large  edges,  we  select  one  such  vertex.  Otherwise,  we 
run  a  selection  procedure  which  produces  a  collection  P  of  vertices  which  cover  at  least  |P|(1  4- 
f),-1(l  —  6 6  —  2e6)  edges,  including  at  least  a  6/2  fraction  of  the  large  edges.  (Later  we  will  give 
both  randomized  and  deterministic  methods  for  performing  this  selection  step.)  In  either  case, 
the  selected  vertex  or  vertices  are  added  to  the  cover  and  deleted,  the  edges  covered  by  these  are 
deleted,  and  vertices  which  now  have  degree  less  than  (14-  c)’-1  are  removed  from  consideration 
for  this  phase. 
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Theorem  2.4  The  number  of  selection  steps  is  O(log2  nlogm). 

Proof:  Clearly  the  number  of  subphases  is  (3(log  n  log  m/ log2(  1  +  c))  =  O(log  n  log  m).  Within 
each  subphase,  each  selection  step  removes  at  least  a  63/(  1  +  c)  fraction  of  P}.  Thus,  there  are 
O(logn)  selection  steps  within  each  subphase.  □ 

Theorem  2.5  The  number  of  vertices  in  the  cover  produced  by  the  above  algorithms  is  at  most 
j^^OogAV,  where  A  is  the  maximum  degree  of  any  vertex ,  and  r*  is  the  size  of  the  minimum 
fractional  cover. 

Proof  Sketch:  Consider  a  selection  step  which  adds  k  vertices  to  the  cover.  These  vertices  will 
cover  a  combined  total  of  at  least  edges,  where  d  is  the  current  maximum  degree  of  any 

vertex.  This  is  at  worst  a  — ~61'5f~—  factor  lower  than  what  the  greedy  algorithm  could  attain.  So, 
intuitively,  one  would  expect  to  do  no  more  than  a  factor  worse  than  the  greedy  algorithm. 

In  fact,  the  proof  of  Lovasz  [Lo]  can  be  modified  to  give  the  theorem.  The  details  will  appear  in  a 
later  version  of  this  paper.  □ 

Now  we  give  randomized  and  deterministic  versions  of  the  selection  procedure.  Let  H,  =  (VJ,  £,) 
be  the  current  hypergraph,  restricted  to  vertices  of  degree  between  (1  +  e),_1  and  (1  +  e)'.  Also,  let 
Etj  Q  Ei  contain  the  edges  of  £,  which  have  at  least  (1  +c)-/_1  vertices.  Furthermore,  we  are  given 
that  no  vertex  in  V,  covers  more  than  a  63/(  1  +  c)  fraction  of  EtJ.  Then  we  want  to  return  a  P  C  V, 
such  that  P  covers  at  least  |P|(1  +  e),-1(l  -  66  -  2 e6)  edges,  including  at  least  a  6/2  fraction  of 
EtJ .  The  randomized  algorithm  generates  P  by  including  each  vertex  of  V,  with  probability 
pairwise  independently.  If  P  is  good,  we  return  it,  otherwise  we  try  again. 

Theorem  2.6  With  probability  at  least  1/8,  a  random  P  is  good. 

Proof:  To  show  P  covers  close  to  |P|(1  +  f)’  edges,  we  will  show  that  P  covers  many  edges  and 
that  P  is  not  too  large.  In  terms  of  our  random  variables,  |P|  =  YlkeV,  Xk,  which  we  will  denote 
by  MX).  We  will  want  /,(*)  <  ^|K|;  Le.,  fi(X)  -  £[/,(A')]  <  To  show  this  is 

likely,  we  will  apply  Chebyshev’s  inequality,  so  let  us  note  that  the  variance  of  fi{X)  is  at  most 
im-(1+g)y.  Moreovei,  let  us  observe  that  |Vf|  >  ,  since  no  vertex  in  Vi  covers  a  fraction  of 

Eij.  Therefore  we  know  that  with  probability  at  least  3/4, 

Ui{X)~E[MX)))2  <  4Var(/1m)  <  4|F,|^-^-  <  4)Vt|2 tj , 

which  implies 

\MX)-E[MX))\  <  2\Vi\ 

A  lower  bound  on  the  number  of  edges  covered  by  P  is 

E(EA'^  -  E  x«xri  =  E  d(v)x*-  E  E  x^x‘- 

e£E,  fc€e  fc,/€e  e££,  fc,/£e 

The  first  term  is  bounded  below  by  |P|(l  +  c)*-1,  which  is  at  least  | V', | ,  if  (/j(A')-£[/i(A')])2  < 

4Var(/i(A'))  as  before.  Denote  the  second  term  by  MX).  The  expectation  of  /2  is  at  most 
(1  +  c)*  2(i+t))  l^«l •  Thus  with  probability  3/4, 

2  62 

MX)  <  4E[MX)\  <  (l  +  ey^-—^\Vt\. 
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Thus  with  pro*-  bility  1/2,  both  events  hold,  so  |P|  <  |V,|  and  at  least 

(1  +  £)"'IKI(lf7P(1‘4<'MO  -  |P|(1  +  <),',LT^ 3— 

>  |F|(H-f)‘_1(l  -66  -26c) 

edges  are  covered. 

Now  we  consider  the  edges  in  Eij.  Similar  to  before,  we  can  lower  bound  the  number  of  heavy 
edges  covered  by  P  with 

£(£**  -  £  x>x‘>  =  £  £**  -  £  £  xtx,. 

e£E,  k£e  k,l€e  e€EtJ  k£e  e£EtJ  k,l£e 

We  will  use  MX)  to  denote  the  first  term  and  MX)  to  denote  the  second.  Also,  similar  to  before, 
we  will  show  that,  with  high  probability,  /3(A )  is  large  and  /4(A)  small.  To  bound  /3(A),  we  will 
use  Chebyshev’s  inequality.  To  compute  Var(/3(A)),  we  rewrite  /3( X)  as  Ylk^v,  dkXk,  where  dk  is 
the  degree  of  vertex  k  in  subl  ypergraph  //,,  —  ( l7,,  Eij).  Thus, 

Var(/3(A))  =  £  djfcVar(Afc)  <  — £  d\. 

itcv',  u-t-f;  keVi 

Since  no  vertex  covers  a  fraction  of  Eij ,  dk  <  A 3 1 J57, 7  j  for  all  k  6  V*.  Thus, 

Var(/3(^))  <  — £-i3|£y  £  d„ 
t1  +  e”  kev, 

=  6*\EXJ\2. 

Also,  £[/3(A)]  >  I  j  I  •  Therefore,  with  probability  7/8, 

(/,(A)-£[/,(A)])2  <  8Var(/3(A))  <  8^|£u|2 

which  implies  /3(A)  >  |£V,|(yA_  _  y/s62).  The  expected  value  of  /4(A)  is  at  most  |£tJ|y.  So  with 
probability  3/4, 

f4(X)  <  4£[/«(A)]  <  \Ex]\n2. 

With  probability  5/8  both  conditions  hold,  and  thus  the  number  of  edges  of  Ei:  covered  is  at  least 
\EtJ\(~--^/862  -  2S2)  >  \EtJ\6(l  -  e  -  56)  >  6-\Eij\. 

With  probability  at  least  1/8,  all  four  conditions  hold  simultaneously,  and  P  is  good.  □ 

Theorem  2.7  There  is  an  NC  algorithm  for  the  selection  procedure. 

Proof:  Observe  that  in  the  proof  above  we  actually  showed  that  the  following  four  conditions 
implj  that  P  is  good. 

(A(A) -£[/,( A)])2  <  4Var(/,(A)), 

MX)  <  4£[/2(A)], 

(/3(A)-£[/3(A)])2  <  8Var(/3(A)),  and 
fA(X)  <  4£[/4(A)]. 


To  make  our  selection  procedure  deterministic,  we  will  capture  these  four  conditions  in  a  BENEFIT 
function,  and  then  apply  Luby’s  method.  Let 


BENEFIT(X)  =  1- 


4Var  (/,(*)) 


(f2(X) 

AE[f2(X)} 


{hiX)-E[f3(X)])'> 

8Var(/3(X)) 


(WO 
4  E(MX)Y 


Clearly,  E[BENEFIT(X )]  =  1/8.  It  is  also  clear  that  if  BENEFIT(X)  >  0,  then  the  four  conditions 
above  are  satisfied.  Therefore  we  can  apply  [L2]  to  get  a  good  P.  □ 

The  number  of  processors  for  our  deterministic  selection  procedure  will  be  0(XTee£',  lel2  +  |K|2), 
which  is  at  most  n  times  the  input  size  of  0(J2eeE  \e\  +  n).  In  the  next  section,  we  show  how  to 
obtain  the  same  result  using  only  a  linear  number  of  processors. 


2.3  Achieving  a  Linear  Number  of  Processors 

In  Sections  2.1  and  2.2,  we  presented  RNC  algorithms  for  set  cover,  which  depend  on  only  pairwise 
independence.  These  algorithms  used  only  a  linear  number  of  processors.  However,  applying  Luby’s 
method  to  determinize  these  algorithms  caused  an  increase  in  the  number  of  processors,  since  we 
require  one  processor  for  each  term  of  the  BENEFIT  function,  expanded  as  a  sum  of  functions 
depending  on  one  or  two  variables  each.  The  reason  the  BENEFIT  functions  have  too  many  terms 
is  that  they  include  sums  of  all  pairs  of  a  subset  of  the  random  variables.  To  achieve  a  linear  number 
of  processors,  we  modify  Luby’s  technique:  instead  of  computing  conditional  expectations  on  the 
terms  of  the  expanded  BENEFIT  function,  we  compute  conditional  expectations  on  terms  of  the 
form  Yltjes  XiXj  directly.  (Note  that  we  can  rewrite  terms  of  the  form  (YlieS^i  -  -FE.es  AT,])2 
as  twice  the  sum  of  all  pairs  of  S  plus  <3(|S|)  other  one-variable  terms.)  This  is  similar  to  the 
technique  used  by  Luby  to  achieve  a  linear  processor  MIS  algorithm  [L2,L3]. 

We  will  demonstrate  how  to  compute  the  conditional  expectations  in  the  simple  case  where  the 
A/’s  are  unbiased.  This  easily  generalizes  to  the  case  where  the  AT,’s  are  identically  distributed  but 
biased.  To  understand  this,  we  must  review  Luby’s  method.  To  find  an  X  with  BENEFIT(X)  > 

E[BENEFIT(X)\,  Luby  lets  X,  =  Ylj= i  *j^j  +<*>/+ 1  mod  2,  where  ijt^  ■  •  •  t;  is  the  binary  expansion 

of  i.  He  then  sets  one  bit  of  w  at  a  time  in  such  a  way  as  to  maximize  the  conditional  expectation 
of  BENEFIT(X). 

We  provide  a  way  to  compute 

E[  ^  X{Xj  |  Wi  =Sl  ,U>2  =  S2,  .  .  .  ,W(  =  S(]. 

>.j€S 

If  t  =  /  +  1,  then  we  know  all  the  A',’s,  and  jeS  XtX3  =  •  Otherwise,  we  partition  S  into 

sets  Sa  -  {i  6  S  |  it+\  ■  ■  ■  ii  =  a}.  We  further  partition  Sa  into  5Ot0  =  {i  -  Sa  |  m°d  2  = 

0}  and  5q,i  =  Sa  —  Sa< o-  Note  that  given  uq  =  S) , . . .  ,u><  - -  , 

1.  Pr[Xi  =  0]  =  Pr[X,  =  1]  =  1/2, 

2.  if  i  6  50,j,  and  i’  £  SaJ »,  then  A',  -  A',-  iff  j  =  j\  and 

3.  if  i  £  SQ  and  i'  e  Sa>,  where  a  /  o',  then  Pr{ X,  =  A',.]  =  Fr[A',  ^  A',-]  =  1  /2. 

Therefore,  conditioned  on  =sj, . . .  ,u;(  =  st. 


ElZx.Xi)  =  CE  £  £  x,x,) 

'.j€5  or  i,j£Sa  a,a' i(zSa  j€Sai 


=  E£[.E  xix>+  E  xixi+  E  E  x^ i  +  Ejim-i 

a  tijeSa.o  «'€Sa,0  jcs0l,  o, a' 

=  £  [iCWsCW^M  +  i?^*1  £  |i“'1 

=  |z  [CV1)  +  CV)]  +  sEl^KI^I  -  ls«D- 

Since  there  are  at  most  |5|  non-empty  Sa' s,  we  can  compute  this  using  0(|Sj)  processors. 

2.4  Weighted  Set  Cover 

In  this  section,  we  consider  the  problem  of  weighted  set  cover:  given  a  hypergraph  H  =  (V',  E)  and 
a  weigh1  function  w  :  V  -*  71.  and  letting  w(S )  ;=  Ylves  w(v)i  find  a  subset  R  C  V  which  covers  H 
such  that  w(R)  is  small.  Not"  that  if  w(v)  =  1  for  all  v  6  V,  this  becomes  normal  set  cover.  Let 
r*  be  the  minimum  weight  of  any  fractional  cover  of  H .  Then, 

Theorem  2.8  (C)  There  is  a  polynomial  time  algorithm  which  outputs  a  cover  R  of  H  with 
w(R)  <  t*  log  A,  where  A  is  the  maximum  degree  of  H. 

Chvatal’s  algorithm  for  weighted  set  cover  is  simply  the  greedy  set  cover  algorithm  of  Johnson  and 
Lovasz  modified  to  always  pick  the  vertex  with  minimum  cost  per  edge  covered,  i.e.  minimum 
We  can  combine  this  technique  with  our  algorithm  of  Section  2.2  to  get 

Theorem  2.9  If  max„€v  w{v)  <  2logC  n  minvev  w(v),  then  there  is  an  NC  algorithm,  using  a  linear 
number  of  processors,  which  outputs  a  cover  R  of  H  with  w(R)  <  r*  log  A. 

Proof  Sketch:  Modify  the  algorithm  of  Section  2.2  so  that  a  phase  considers  only  those  vertices 
with  within  (l  +  «)  of  the  minimum  value  still  left.  The  restriction  of  the  weight  function 
ensures  that  there  will  be  only  poly-log  many  phases.  The  structure  of  the  subphases  is  as  before. 
A  slightly  more  complicated  BENEFIT  function  is  required,  since  the  vertices  can  have  widely 
varying  degrees,  but  it  is  of  the  same  basic  form.  The  proof  of  Chvata.1  can  be  modified  to  show 
that  this  algorithm  performs  near-optimally  as  stated.  □ 

3  Application  to  Learning  Theory 

In  this  section,  we  apply  the  set  cover  algorithm  of  Section  2.2  to  parallel  learning,  a  field  first 
explored  by  Vitter  and  Lin  [VL].  In  particular,  we  provide  an  NC  algorithm  for  learning  in  concept 
classes  that  are  formed  by  taking  either  finite  unions  or  finite  intersections  of  a  fixed  base  class 
of  finite  VC  dimension.  For  example,  convex  polygons  are  defined  by  finite  intersections  of  half¬ 
planes.  We  show  that  classes  of  this  type  are  NC-learnable  whenever  there  is  an  NC  algorithm 
for  finding  a  consistent  hypothesis  in  the  base  class  (Theorem  3.8).  In  obtaining  this  result,  we 
employ  our  parallel  set  cover  algorithm  to  obtain  a  sufficiently  simple  explanation  of  the  sample 
data.  Blumer,  Ehrenfeucht,  Haussler,  and  Warmuth  [BEHW]  had  previously  solved  this  problem 
in  a  polynomial-time  model. 

The  only  previous  parallel  work  on  this  subject  is  in  [VL].  They  give  an  RNC  algorithm  for 
learning  s-fold  unions  of  axis-parallel  rectangles  in  the  plane,  by  using  a  randomized  set  cover 
algorithm  which  is  heavily  tied  to  their  specific  problem.  Our  general  techniques  apply  directly  to 
this  problem.  In  addition,  while  [VL]  may  produce  a  hypothesis  with  up  to  s4  logm  rectangles,  our 
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method  will  always  produce  one  with  at  most  slogm  rectangles,  due  to  the  performance  guarantee 
of  our  set  cover  algorithm. 

The  following  definitions  are  adapted  from  [BEHW,  VL]: 

Definition  3.1  Fix  a  domain  X.  A  concept  class  is  a  nonempty  set  C  C  2*  of  concepts.  In  this 
paper,  it  is  assumed  that  X  is  a  fixed  set,  either  finite,  countably  infinite,  [0,  l]n,  or  En  for  some 
n  >  1. 

Definition  3.2  Given  a  nonempty  concept  class  C  C  2X  and  a  set  of  points  S  Q  X,  IIc(5) 
denotes  the  set  of  all  subsets  of  5  that  can  be  obtained  by  intersecting  5  with  a  concept  in  C;  i.e., 
ncC-S)  =  {5  n  c  |  c  €  C }.  If  IIcC-S1)  =  2s,  then  we  say  that  S  is  shattered  by  C.  The  Vapnik- 
Chervonenkis  (VC)  dimension  of  C  is  the  cardinality  of  the  largest  finite  set  of  points  5  C  X  that 
is  shattered  by  C.  If  arbitrarily  large  finite  sets  are  shattered,  the  VC  dimension  of  C  is  infinite. 

Definition  3.3  For  any  integer  m  >  0,  Ilc(m)  =  max(|IIc'(«S,)|)  over  all  S  C  X  of  cardinality  m. 

Using  this  notation,  the  VC  dimension  of  C  can  be  defined  as  the  largest  integer  d  such  that 
Ilc(of)  =  2d,  or  infinity. 

Definition  3.4  The  length  of  concept  c,  denoted  |c),  is  the  number  of  bits  required  to  write  c  in 
some  standard  encoding. 

Definition  3.5  Let  C  be  defined  as  above.  We  say  that  C  is  NC-leamable  if  there  exists  an  NC 
algorithm  A  that  takes  as  input  a  sample  of  a  concept  in  C,  outputs  a  hypothesis  in  C,  and  has  the 
property  that  for  all  0<e,6  <  1,  and  s  >  1  there  exists  a  sample  size  m(e,6,s),  polynomial  in  1/e, 
1/6,  and  s,  such  that  for  all  target  concepts  c  €  C  with  |c|  <  s,  and  all  probability  distributions 
P  on  X,  given  a  random  sample  of  c  of  size  m(e,  6,  s)  drawn  independently  according  to  P,  A 
produces,  with  probability  at  least  1  —  6,  a  hypothesis  h  €  C  that  has  error  at  most  e  such  that 
P[c  0  h],  where  0  denotes  the  symmetric  difference. 

Definition  3.6  Let  C  C  X  be  a  concept  class.  By  U(C )  we  denote  the  closure  of  C  under  finite 
unions,  i.e., 

U(C )  =  {ci  U  •  •  •  U  c,  |  s  >  1  and  c,  €  C,  1  <  *  <  s}. 

Similarly,  1(C)  denotes  the  closure  of  C  under  finite  intersections. 

Definition  3.7  Let  C  be  a  concept  class.  An  NC  hypothesis  finder  for  C  is  an  NC  algorithm  that, 
given  a  sample  of  a  target  concept  C,  returns  a  hypothesis  in  C  that  is  consistent  with  the  sample. 
Note  that  we  do  not  consider  randomized  hypothesis  finders  here.  The  consistency  problem  for  C 
is  the  problem  of  determining  if  there  is  a  concept  in  C  that  is  consistent  with  a  given  sample  over 
X. 


The  existence  of  an  NC  hypothesis  finder  for  C  implies  that  the  consistency  problem  for  C  is 
in  NC. 

Theorem  3.8  Let  C  be  a  concept  class  with  VC  dimension  d  <  oo  such  that  there  exists  an  NC 
hypothesis  finder  for  C .  Then  U(C)  (resp.  1(C))  is  NC-leamable. 

We  will  give  the  proof  below.  First  we  handle  some  preliminaries. 
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Definition  3.9  Let  C  be  a  concept  class  defined  on  domain  X.  Let  A  be  an  NC  algorithm  that, 
given  a  sample  of  a  concept  in  C,  produces  a  consistent  hypothesis  in  C.  For  every  s,m  >  1,  let 
Sc,s,m  denote  the  set  of  all  m-samples  of  concepts  c  £  C  such  that  |c|  <  s.  Let  Cf<m  C  C  denote 
the  A-image  of  i-e.,  the  set  of  all  hypothesis  produced  by  A  when  A  is  given  as  input  an 

m-sample  of  a  concept  c  £  C  with  |c|  <  s.  We  will  call  Cfm  the  effective  hypothesis  space  of  A  for 
target  complexity  s  and  sample  size  m.  We  say  A  is  an  NC- Occam  algorithm  for  C  if  there  exists 
a  polynomial  p(s)  and  a  constant  a,  0<a<  1,  such  that  for  all  s,  m  >  1  the  VC  dimension  of  C^m 
is  at  most  p(s)ma. 

Theorem  3.10  (BEHW)  Let  C  be  a  concept  class  with  a  given  concept  complexity  measure. 

1.  If  there  is  an  NC-Occarr  algorithm  for  C  then  C  is  NC-leamable. 

2.  L(  t  A  be  an  NC-Occani  algorithm  for  C  with  effective  hypothesis  space  C*m  for  target  com¬ 
plexity  s  and  sample  size  m.  Then,  if  the  VC  dimension  of  Cfm  is  at  most  p(s)(\ogm)!  for 
some  polynomial  p(s)  >  2  and  l  >  1,  then  A  is  an  NC  learning  algorithm  for  C  using  sample 
size 

log  8(2/  +  2),+1p(s)  /+1~ 

Lemma  3.11  (BEHW)  Let  C  C  2X  be  a  concept  class  of  finite  VC  dimension  d  >  1.  For  all 
s  >  1,  let  U,(C)  =  {Ui=i  Ci  I  Ci  £  C,  1  <  i  <  a}  (resp.  I,(C )  =  {f)Li  c,  |  Cj  G  C,  1  <  i  <  s}). 
Then  for  all  s  >  1,  the  VC  dimension  of  Cs  is  less  than  2dslog(3s). 

Proposition  3.12  (BEHW)  If  the  VC  dimension  of  H  is  d  >  0,  then  nj/(m)  <  md  +  1. 

We  now  use  Theorem  3.10  to  demonstrate  the  learnability  of  many  concept  classes  of  the  form 
U(C )  and  1(C)  for  C  of  finite  VC  dimension. 

Lemma  3.13  If  C  has  finite  VC  dimension  d  <  oo  and  the  consistency  problem  for  C  is  in  NC, 
then  for  any  finite  set  S  C  X ,  the  sets  of  IIc(S)  can  be  listed  in  NC. 

Proof  Assume  S  =  {®i,. .  To  produce  a  list  L  of  II c(S),  we  proceed  as  follows.  If  m  =  1, 

we  use  the  consistency  algorithm  for  C  to  check  if  {xi}  is  consistent,  and  if  so  return  {0,  {xi}}, 
otherwise  return  {0}.  If  m  >  1,  then  we  recurse  (in  parallel)  to  get  L\  —  IIc({xi, . .  .,x^m/2j}) 
and  Li  =  IIc({x|^m/2j+i,  •  •  ■ ,  *m})-  Then  in  parallel  for  all  pairs  T\  €  L\,  T 2  G  Li,  we  check  the 
consistency  of  T\  U  T2,  and  return  {7\  U  Ti  |  T\  £  L\,  Ti  £  Li,  T\\jTi  consistent}.  The  depth  of 
the  recursion  is  logm;  furthermore,  since  by  Proposition  3.12  the  size  of  IIc(5)  is  at  most  |5|d  +  1, 
we  run  the  consistency  algorithm  at  most  0(m7d)  times  in  parallel  at  each  level  of  the  recursion. 
Therefore,  this  algorithm  is  in  NC.  □ 

Proof  of  Theorem  3.8  We  consider  only  the  case  U(C),  the  other  case  being  similar.  Let  S  be 
the  set  of  points  in  an  m-sample  of  a  target  concept  c  in  U(C).  Our  strategy  will  be  to  find  a 
hypothesis  consistent  with  5  that  is  formed  from  the  union  of  relatively  few  concepts  in  C;  i.e.  not 
many  more  than  s.  This  problem  can  be  formulated  as  a  set  cover  problem.  The  set  to  be  covered 
is  the  set  of  positive  points  of  5  and  the  sets  allowed  in  the  cover  are  the  elements  of  IIc(5)  that 
contain  only  positive  points. 

By  Lemma  3.13,  we  can  construct  IIc(S)  in  NC.  Then,  in  parallel,  we  can  easily  compute  A  = 
{T  £  ITc(5)  |  T  contains  only  positive  points  of  5}  and  P  =  {x  £  S  |  x  is  a  positive  example}. 
We  can  then  apply  the  set  cover  algorithm  of  Section  2.2  to  obtain  a  cover  of  size  (?(s!ogm).  For 


m  =  max 


4  ,  2  2 1+4 p(s) 

-log-,  - 

€  0  e 
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each  set  in  the  cover,  we  can  label  the  other  points  negative  and  run  the  NC  hypothesis  finder  for 
C  to  produce  a  hypothesis  in  C  that  contains  only  these  points  of  the  sample.  Taking  the  union  of 
these  concepts,  we  obtain  a  hypothesis  in  U(C).  Call  this  algorithm  A. 

We  have  shown  that  A  is  in  NC  and  that  given  any  m-sample  of  a  concept  c  in  U(C)  with  |c|  <  s, 
A  produces  a  consistent  hypothesis  h  for  this  sample  with  |h|  <  O(slnm).  Hence  the  effective 
hypothesis  space  U(Cf  m)  of  A  for  target  complexity  s  and  sample  size  m  contains  only  hypotheses 
such  that  |/i|  <  O(slnm).  By  Lemma  3.11,  the  VC  dimension  of  U{Cfm)  is  0(slog(m)(logs  + 
loglogm)).  Hence,  A  is  an  NC-Occam  algorithm  for  U(C)  and  thus  by  Theorem  3.10,  U(C)  is 
NC-learnable.  □ 

4  Application  to  Computational  Geometry 

Randomization  is  a  tool  which  has  been  used  extensively  to  construct  algorithms  in  computational 
geometry  [CU,  C12,  C13,  EGS,  HW,  RSI,  RS2].  Recently,  Chazelle  and  Friedman  [CF]  showed  how 
to  make  most  of  the  randomized  algorithms  used  in  computational  geometry  deterministic,  albeit 
at  the  cost  of  a  substantial  increase  in  running  time.  Our  results  can  be  used  to  put  many  of 
these  algorithms  into  deterministic  NC,  with  the  number  of  processors  required  being  equal  to  the 
running  time  of  Chazelle  and  Friedman’s  algorithm.  We  will  very  briefly  outline  their  technique 
and  show  some  of  the  ideas  needed  to  make  the  geometric  applications  parallel. 

To  remove  randomization,  Chazelle  and  Friedman  [CF]  use  balanced  set  cover.  The  basic 
technique  used  by  Chazelle  and  Friedman  is:  First,  construct  a  hypergraph  which  captures  the 
behavior  of  the  randomized  algorithm.  Next,  find  a  balanced  set  cover  for  this  hypergraph.  Finally, 
use  the  set  cover  found  in  the  algorithm  to  decide  which  elements  to  use  as  the  randomly  chosen 
subset  in  the  randomized  version  of  the  algorithm.  For  many  problems,  the  first  and  third  steps 
are  easy  to  do  in  NC.  The  second  step  is  simply  Section  2.1  of  our  paper.  In  general,  the  number 
of  processors  required  in  steps  1  and  3  is  easily  seen  to  be  less  than  the  number  required  in  step  2, 
because  of  the  large  size  of  the  hypergraph  constructed  by  Chazelle  and  Friedman. 

All  the  algorithms  discussed  in  [CF]  use  a  variation  of  a  technique  which  they  call  probabilistic 
divide-and-conquer.  The  example  they  give  is:  given  n  hyperplanes  in  d- space  and  a  parameter 
r,  find  a  simplicial  cell  decomposition  of  d-space  such  that  each  cell  intersects  O(n(logr)/r)  hy¬ 
perplanes.  This  decomposition,  or  a  similar  one,  is  used  in  many  range  search  problems  [CU,  C12, 
C13,  EGS,  HW].  It  is  generally  used  to  construct  a  data  structure  which  consists  of  a  logn  depth 
tree,  with  each  node  of  the  tree  being  a  simplex,  and  the  children  of  a  node  being  the  simplices 
determined  by  a  random  sample  of  the  hyperplanes  intersecting  that  node.  Since  the  tree  has  only 
logn  depth,  if  there  is  an  NC  algorithm  for  constructing  a  level  of  the  tree,  i.e.,  for  finding  this 
simplicial  cell  decomposition  of  d-space  such  that  each  cell  intersects  few  hyperplanes,  then  building 
the  entire  tree  is  in  NC. 

Once  we  have  an  NC  balanced  set  cover  algorithm,  we  still  need  an  NC  algorithm  for  con¬ 
structing  the  hypergraph  which  is  to  be  covered.  In  [CF],  a  universal  triangulation  scheme  is  used 
to  construct  this  hypergraph.  A  triangulation  of  an  arrangement  of  hyperplanes  in  d-spaces  is  a 
decomposition  of  the  cells  of  that  arrangement  into  simplices.  By  universal  it  is  meant  that  there 
exists  a  canonical  scheme  for  dividing  a  cell  of  an  arrangement  into  simplices  which  depends  only 
on  that  cell,  and  not  on  the  whole  arrangement.  The  very  fact  that  their  triangulation  scheme  is 
universal  (which  was  necessary  for  their  algorithm)  means  that  this  triangulation  is  easy  to  achieve 
in  parallel,  as  it  only  depends  on  local  information. 

One  final  thing  to  do  is,  given  a  set  of  hyperplanes  in  d-dimensionaJ  space,  construct  in  NC  the 
arrangement  of  cells  produced  by  these  hvperplanes.  In  [CF]  this  is  done  by  appealing  to  [EOS], 
which  is  an  inherently  sequential  algorithm.  One  can  construct  the  arrangement  in  NC  by  using  an 


alternative  algorithm  which  has  d  stages.  At  the  fcth  stage,  we  construct  the  arrangement  induced 
by  the  hyperplanes  in  all  the  A:-flats  of  the  arrangement.  We  do  this  by  using  the  arrangements 
induced  by  the  hyperplanes  in  the  (&  —  l)-flats,  which  we  constructed  in  the  previous  stage.  For 
each  &-flat  we  will  construct  a  graph  which  will  give  us  the  cells  of  the  arrangements  in  the  flat  as 
connected  components  of  the  graph.  For  each  (fc-l)-face  in  a  (fc-l)-flat  contained  in  the  fc-flat,  we 
associate  two  vertices  of  the  graph.  One  vertex  will  correspond  to  each  “side”  of  the  (k  —  l)-face. 
We  then  construct  the  graph  by  connecting  a  vertex  corresponding  to  a  (k  —  l)-face  to  vertices 
corresponding  to  (k— l)-faces  which  are  in  a  different  (k—  l)-flat  and  which  see  the  neighboring  faces 
of  a  cell  of  the  arrangement  induced  in  the  A:-flat.  It  is  not  hard  to  see  that  this  can  be  done  in  NC. 
The  connected  components  of  this  graph  will  give  the  fc-faces  of  the  arrangements  in  the  fc-flats, 
and  as  we  know  which  (k  —  l)-faces  are  sub-faces  of  a  given  face,  this  gives  us  the  arrangement. 
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